High - level Vision as Statistical Inference
نویسنده
چکیده
Human vision is remarkably versatile and reliable, despite the fact that retinal image information is noisy, ambiguous, and confounds the properties of objects that are useful. By treating vision as a problem of statistical inference, three classes of constraints can be identified: the visual task, prior knowledge of scene structure independent of the image, and the relationship between image structure and task requirements. By considering the visual system as an organ for statistical inference, we can test whether and how it uses these constraints. This strategy is illustrated for two high-level visual functions: depth-from-cast-shadows and viewpoint compensation in 3-D object recognition. An object’s relative depth can be determined from its cast shadow, even when local image information doesn’t uniquely specify shadow edges, and global information doesn’t determine where the light source is. What information enables a unique estimate of depth from shadows? This chapter shows how the visual task, prior assumptions on light movement and material properties, and local image cues constrain the perception of depth from shadows. A 3D object can be recognized from views never seen before, despite the fact that depth information about shape is lost due to projection on to the retina. How does human recognition compensate for variations in viewpoint? By designing a simple recognition task for which optimal statistical decisions are computable, human performance can be normalized with respect to the information in the task, leaving remaining differences diagnostic of brain mechanisms.
منابع مشابه
Composite Statistical Learning and Inference for Semantic Segmentation
In this paper we present a learning and inference framework, Composite Statistical Learning and Inference (CSLI), for random fields with extremely high order interactions. Instead of conventional probabilistic approaches that build models on clique potentials, we propose to focus on subset statistics from overlapping random variable subsets and employ composite likelihood approaches for learnin...
متن کاملBayesian Statistical Methods for Audio and Music Processing
Bayesian statistical methods provide a formalism for arriving at solutions to various problems faced in audio processing. In real environments, acoustical conditions and sound sources are highly variable, yet audio signals often possess significant statistical structure. There is a great deal of prior knowledge available about why this statistical structure is present. This includes knowledge o...
متن کاملStatistical Approaches to Inferring Object Shape from Single Images
Depth inference is a fundamental problem of computer vision with a broad range of potential applications. Monocular depth inference techniques, particularly shape from shading dates back to as early as the 40's when it was rst used to study the shape of the lunar surface. Since then there has been ample research to develop depth inference algorithms using monocular cues. Most of these are based...
متن کاملStatistical Inference in Autoregressive Models with Non-negative Residuals
Normal residual is one of the usual assumptions of autoregressive models but in practice sometimes we are faced with non-negative residuals case. In this paper we consider some autoregressive models with non-negative residuals as competing models and we have derived the maximum likelihood estimators of parameters based on the modified approach and EM algorithm for the competing models. Also,...
متن کاملMarkov Random Field modeling, inference & learning in computer vision & image understanding: A survey
In this paper, we present a comprehensive survey of Markov Random Fields (MRFs) in computer vision and image understanding, with respect to the modeling, the inference and the learning. While MRFs were introduced into the computer vision field about two decades ago, they started to become a ubiquitous tool for solving visual perception problems around the turn of the millennium following the em...
متن کامل